Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retry to use Arc<Statistic> in PartitionedFile again #11894

Closed
wants to merge 6 commits into from

Conversation

Rachelint
Copy link
Contributor

Which issue does this PR close?

Closes #11885

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

@github-actions github-actions bot added the core Core DataFusion crate label Aug 8, 2024
@alamb
Copy link
Contributor

alamb commented Aug 8, 2024

See also #11893

@@ -391,6 +391,12 @@ impl ParquetExecBuilder {
&projected_output_ordering,
&base_config,
);

base_config
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

that is an interesting idea

Copy link
Contributor Author

@Rachelint Rachelint Aug 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

that is an interesting idea

But it still seem to make some long queries slower...

After finish the refactor work about blocked accumulators sketch. I plan to try to reduce the cost of the expansive clone scalar values like ScalarValue::String(String -> Arc<str>), and see if it can faster the shorts and don't slower the longs...

Copy link

Thank you for your contribution. Unfortunately, this pull request is stale because it has been open 60 days with no activity. Please remove the stale label or comment or this will be closed in 7 days.

@github-actions github-actions bot added the Stale PR has not had any activity for some time label Oct 15, 2024
@github-actions github-actions bot closed this Oct 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Core DataFusion crate Stale PR has not had any activity for some time
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Use Arc<Statistics> rather than Statistics in PartitionedFile
2 participants